Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 199146 |
| Missing cells | 39721 |
| Missing cells (%) | 1.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 108.0 MiB |
| Average record size in memory | 568.6 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 7 |
FORMULARIO has a high cardinality: 199146 distinct values | High cardinality |
FECHA_OCURRENCIA_ACC has a high cardinality: 2445 distinct values | High cardinality |
DIRECCION has a high cardinality: 100085 distinct values | High cardinality |
FECHA_HORA_ACC has a high cardinality: 162457 distinct values | High cardinality |
X is highly correlated with LONGITUD | High correlation |
Y is highly correlated with LATITUD | High correlation |
OBJECTID is highly correlated with CODIGO_ACCIDENTE and 1 other fields | High correlation |
CODIGO_ACCIDENTE is highly correlated with OBJECTID and 1 other fields | High correlation |
ANO_OCURRENCIA_ACC is highly correlated with OBJECTID and 1 other fields | High correlation |
LATITUD is highly correlated with Y | High correlation |
LONGITUD is highly correlated with X | High correlation |
X is highly correlated with LONGITUD | High correlation |
Y is highly correlated with LATITUD | High correlation |
OBJECTID is highly correlated with CODIGO_ACCIDENTE and 1 other fields | High correlation |
CODIGO_ACCIDENTE is highly correlated with OBJECTID and 1 other fields | High correlation |
ANO_OCURRENCIA_ACC is highly correlated with OBJECTID and 1 other fields | High correlation |
LATITUD is highly correlated with Y | High correlation |
LONGITUD is highly correlated with X | High correlation |
X is highly correlated with LONGITUD | High correlation |
Y is highly correlated with LATITUD | High correlation |
CODIGO_ACCIDENTE is highly correlated with ANO_OCURRENCIA_ACC | High correlation |
ANO_OCURRENCIA_ACC is highly correlated with CODIGO_ACCIDENTE | High correlation |
LATITUD is highly correlated with Y | High correlation |
LONGITUD is highly correlated with X | High correlation |
X is highly correlated with Y and 4 other fields | High correlation |
Y is highly correlated with X and 4 other fields | High correlation |
OBJECTID is highly correlated with CODIGO_ACCIDENTE and 1 other fields | High correlation |
CODIGO_ACCIDENTE is highly correlated with OBJECTID and 1 other fields | High correlation |
ANO_OCURRENCIA_ACC is highly correlated with OBJECTID and 1 other fields | High correlation |
LOCALIDAD is highly correlated with X and 4 other fields | High correlation |
LATITUD is highly correlated with X and 4 other fields | High correlation |
LONGITUD is highly correlated with X and 4 other fields | High correlation |
CIV is highly correlated with X and 4 other fields | High correlation |
PK_CALZADA has 37974 (19.1%) missing values | Missing |
FORMULARIO is uniformly distributed | Uniform |
FECHA_HORA_ACC is uniformly distributed | Uniform |
OBJECTID has unique values | Unique |
FORMULARIO has unique values | Unique |
CODIGO_ACCIDENTE has unique values | Unique |
Reproduction
| Analysis started | 2022-05-20 01:24:36.781930 |
|---|---|
| Analysis finished | 2022-05-20 01:25:07.514404 |
| Duration | 30.73 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 109453 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -74.10409559 |
| Minimum | -74.2283 |
|---|---|
| Maximum | -74.011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 199146 |
| Negative (%) | 100.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -74.2283 |
|---|---|
| 5-th percentile | -74.17267833 |
| Q1 | -74.13419566 |
| median | -74.10304278 |
| Q3 | -74.07300849 |
| 95-th percentile | -74.04185926 |
| Maximum | -74.011 |
| Range | 0.2173 |
| Interquartile range (IQR) | 0.06118717125 |
Descriptive statistics
| Standard deviation | 0.04009853015 |
|---|---|
| Coefficient of variation (CV) | -0.00054111085 |
| Kurtosis | -0.6560623801 |
| Mean | -74.10409559 |
| Median Absolute Deviation (MAD) | 0.030506752 |
| Skewness | -0.1595573156 |
| Sum | -14757534.22 |
| Variance | 0.00160789212 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -74.103 | 254 | 0.1% |
| -74.138 | 244 | 0.1% |
| -74.112 | 221 | 0.1% |
| -74.139 | 216 | 0.1% |
| -74.15405306 | 206 | 0.1% |
| -74.079 | 195 | 0.1% |
| -74.08952915 | 194 | 0.1% |
| -74.1 | 189 | 0.1% |
| -74.084 | 184 | 0.1% |
| -74.11448906 | 176 | 0.1% |
| Other values (109443) | 197067 |
| Value | Count | Frequency (%) |
| -74.2283 | 1 | |
| -74.218 | 1 | |
| -74.2152414 | 1 | |
| -74.215 | 2 | |
| -74.21498272 | 1 | |
| -74.21495012 | 1 | |
| -74.21492074 | 1 | |
| -74.21477615 | 1 | |
| -74.2147 | 1 | |
| -74.2146 | 1 |
| Value | Count | Frequency (%) |
| -74.011 | 1 | |
| -74.013 | 2 | |
| -74.013247 | 1 | |
| -74.0139 | 1 | |
| -74.01391874 | 1 | |
| -74.01398528 | 1 | |
| -74.014 | 1 | |
| -74.01405288 | 1 | |
| -74.01447867 | 1 | |
| -74.01461162 | 1 |
| Distinct | 110417 |
|---|---|
| Distinct (%) | 55.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.649135264 |
| Minimum | 4.0858 |
|---|---|
| Maximum | 4.828040683 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 4.0858 |
|---|---|
| 5-th percentile | 4.559559054 |
| Q1 | 4.608163175 |
| median | 4.645415342 |
| Q3 | 4.690015397 |
| 95-th percentile | 4.746763261 |
| Maximum | 4.828040683 |
| Range | 0.742240683 |
| Interquartile range (IQR) | 0.081852222 |
Descriptive statistics
| Standard deviation | 0.05756745119 |
|---|---|
| Coefficient of variation (CV) | 0.01238239972 |
| Kurtosis | -0.2474953999 |
| Mean | 4.649135264 |
| Median Absolute Deviation (MAD) | 0.0412219065 |
| Skewness | 0.01773892237 |
| Sum | 925856.6914 |
| Variance | 0.003314011436 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.597 | 355 | 0.2% |
| 4.632361373 | 206 | 0.1% |
| 4.695739417 | 194 | 0.1% |
| 4.63 | 186 | 0.1% |
| 4.615541137 | 176 | 0.1% |
| 4.627 | 167 | 0.1% |
| 4.627151357 | 155 | 0.1% |
| 4.751609833 | 155 | 0.1% |
| 4.628 | 154 | 0.1% |
| 4.629 | 153 | 0.1% |
| Other values (110407) | 197245 |
| Value | Count | Frequency (%) |
| 4.0858 | 1 | |
| 4.191248408 | 1 | |
| 4.3031 | 1 | |
| 4.372 | 1 | |
| 4.382 | 1 | |
| 4.385 | 1 | |
| 4.386 | 1 | |
| 4.387 | 1 | |
| 4.388244867 | 1 | |
| 4.391 | 1 |
| Value | Count | Frequency (%) |
| 4.828040683 | 1 | < 0.1% |
| 4.825 | 2 | < 0.1% |
| 4.82442932 | 1 | < 0.1% |
| 4.823752166 | 1 | < 0.1% |
| 4.823154592 | 1 | < 0.1% |
| 4.821924473 | 1 | < 0.1% |
| 4.821 | 1 | < 0.1% |
| 4.8208 | 1 | < 0.1% |
| 4.820762417 | 12 | |
| 4.820694877 | 1 | < 0.1% |
| Distinct | 199146 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 246116.2682 |
| Minimum | 1 |
|---|---|
| Maximum | 421911 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 17822.5 |
| Q1 | 135603.75 |
| median | 283027.5 |
| Q3 | 358176.75 |
| 95-th percentile | 411953.75 |
| Maximum | 421911 |
| Range | 421910 |
| Interquartile range (IQR) | 222573 |
Descriptive statistics
| Standard deviation | 128297.6317 |
|---|---|
| Coefficient of variation (CV) | 0.5212887092 |
| Kurtosis | -1.136518307 |
| Mean | 246116.2682 |
| Median Absolute Deviation (MAD) | 93717 |
| Skewness | -0.4386851747 |
| Sum | 4.901307034 × 1010 |
| Variance | 1.646028231 × 1010 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 341113 | 1 | < 0.1% |
| 341091 | 1 | < 0.1% |
| 341092 | 1 | < 0.1% |
| 341093 | 1 | < 0.1% |
| 341094 | 1 | < 0.1% |
| 341095 | 1 | < 0.1% |
| 341096 | 1 | < 0.1% |
| 341097 | 1 | < 0.1% |
| 341098 | 1 | < 0.1% |
| Other values (199136) | 199136 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 4 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 15 | 1 |
| Value | Count | Frequency (%) |
| 421911 | 1 | |
| 421910 | 1 | |
| 421909 | 1 | |
| 421908 | 1 | |
| 421907 | 1 | |
| 421906 | 1 | |
| 421905 | 1 | |
| 421904 | 1 | |
| 421903 | 1 | |
| 421902 | 1 |
| Distinct | 199146 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.6 MiB |
| A000640275 | 1 |
|---|---|
| A001057677 | 1 |
| A000872644 | 1 |
| A001303945 | 1 |
| A001299396 | 1 |
| Other values (199141) |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 9.560382835 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1903912 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 199146 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | A000640275 |
|---|---|
| 2nd row | A001233353 |
| 3rd row | A001232786 |
| 4th row | A000200705 |
| 5th row | A000402862 |
Common Values
| Value | Count | Frequency (%) |
| A000640275 | 1 | < 0.1% |
| A001057677 | 1 | < 0.1% |
| A000872644 | 1 | < 0.1% |
| A001303945 | 1 | < 0.1% |
| A001299396 | 1 | < 0.1% |
| A001297968 | 1 | < 0.1% |
| A001238387 | 1 | < 0.1% |
| A001058034 | 1 | < 0.1% |
| A001057979 | 1 | < 0.1% |
| A001058515 | 1 | < 0.1% |
| Other values (199136) | 199136 |
Length
| Value | Count | Frequency (%) |
| a000640275 | 1 | < 0.1% |
| a000816158 | 1 | < 0.1% |
| a000690468 | 1 | < 0.1% |
| a001235038 | 1 | < 0.1% |
| a001185526 | 1 | < 0.1% |
| a001183932 | 1 | < 0.1% |
| a001180302 | 1 | < 0.1% |
| a001232786 | 1 | < 0.1% |
| a000200705 | 1 | < 0.1% |
| a000402862 | 1 | < 0.1% |
| Other values (199136) | 199136 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 612727 | |
| A | 199146 | 10.5% |
| 1 | 181332 | 9.5% |
| 6 | 123831 | 6.5% |
| 3 | 119791 | 6.3% |
| 4 | 117287 | 6.2% |
| 2 | 115104 | 6.0% |
| 9 | 112019 | 5.9% |
| 7 | 111080 | 5.8% |
| 8 | 108141 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1704766 | |
| Uppercase Letter | 199146 | 10.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 612727 | |
| 1 | 181332 | 10.6% |
| 6 | 123831 | 7.3% |
| 3 | 119791 | 7.0% |
| 4 | 117287 | 6.9% |
| 2 | 115104 | 6.8% |
| 9 | 112019 | 6.6% |
| 7 | 111080 | 6.5% |
| 8 | 108141 | 6.3% |
| 5 | 103454 | 6.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 199146 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1704766 | |
| Latin | 199146 | 10.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 612727 | |
| 1 | 181332 | 10.6% |
| 6 | 123831 | 7.3% |
| 3 | 119791 | 7.0% |
| 4 | 117287 | 6.9% |
| 2 | 115104 | 6.8% |
| 9 | 112019 | 6.6% |
| 7 | 111080 | 6.5% |
| 8 | 108141 | 6.3% |
| 5 | 103454 | 6.1% |
Latin
| Value | Count | Frequency (%) |
| A | 199146 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1903912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 612727 | |
| A | 199146 | 10.5% |
| 1 | 181332 | 9.5% |
| 6 | 123831 | 6.5% |
| 3 | 119791 | 6.3% |
| 4 | 117287 | 6.2% |
| 2 | 115104 | 6.0% |
| 9 | 112019 | 5.9% |
| 7 | 111080 | 5.8% |
| 8 | 108141 | 5.7% |
CODIGO_ACCIDENTE
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 199146 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7368237.845 |
| Minimum | 4401420 |
|---|---|
| Maximum | 10549255 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 4401420 |
|---|---|
| 5-th percentile | 4412974.25 |
| Q1 | 4458053.25 |
| median | 4512847.5 |
| Q3 | 10497938.5 |
| 95-th percentile | 10539216.75 |
| Maximum | 10549255 |
| Range | 6147835 |
| Interquartile range (IQR) | 6039885.25 |
Descriptive statistics
| Standard deviation | 3017820.345 |
|---|---|
| Coefficient of variation (CV) | 0.409571516 |
| Kurtosis | -1.994164264 |
| Mean | 7368237.845 |
| Median Absolute Deviation (MAD) | 107110 |
| Skewness | 0.0736155348 |
| Sum | 1.467355094 × 1012 |
| Variance | 9.107239632 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4484660 | 1 | < 0.1% |
| 10501475 | 1 | < 0.1% |
| 10462245 | 1 | < 0.1% |
| 10540728 | 1 | < 0.1% |
| 10539860 | 1 | < 0.1% |
| 10538352 | 1 | < 0.1% |
| 10535716 | 1 | < 0.1% |
| 10501434 | 1 | < 0.1% |
| 10501435 | 1 | < 0.1% |
| 10501438 | 1 | < 0.1% |
| Other values (199136) | 199136 |
| Value | Count | Frequency (%) |
| 4401420 | 1 | |
| 4401421 | 1 | |
| 4401422 | 1 | |
| 4401423 | 1 | |
| 4401424 | 1 | |
| 4401425 | 1 | |
| 4401426 | 1 | |
| 4401428 | 1 | |
| 4401429 | 1 | |
| 4401430 | 1 |
| Value | Count | Frequency (%) |
| 10549255 | 1 | |
| 10549254 | 1 | |
| 10549253 | 1 | |
| 10549252 | 1 | |
| 10549251 | 1 | |
| 10549250 | 1 | |
| 10549249 | 1 | |
| 10549248 | 1 | |
| 10549247 | 1 | |
| 10549246 | 1 |
| Distinct | 2445 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.0 MiB |
| 2019/12/06 00:00:00+00 | 155 |
|---|---|
| 2016/11/08 00:00:00+00 | 147 |
| 2017/10/27 00:00:00+00 | 143 |
| 2018/05/11 00:00:00+00 | 141 |
| 2018/06/08 00:00:00+00 | 141 |
| Other values (2440) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 4381212 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017/06/12 00:00:00+00 |
|---|---|
| 2nd row | 2020/11/19 00:00:00+00 |
| 3rd row | 2020/11/10 00:00:00+00 |
| 4th row | 2015/05/11 00:00:00+00 |
| 5th row | 2016/06/08 00:00:00+00 |
Common Values
| Value | Count | Frequency (%) |
| 2019/12/06 00:00:00+00 | 155 | 0.1% |
| 2016/11/08 00:00:00+00 | 147 | 0.1% |
| 2017/10/27 00:00:00+00 | 143 | 0.1% |
| 2018/05/11 00:00:00+00 | 141 | 0.1% |
| 2018/06/08 00:00:00+00 | 141 | 0.1% |
| 2016/11/15 00:00:00+00 | 139 | 0.1% |
| 2016/10/13 00:00:00+00 | 137 | 0.1% |
| 2018/03/09 00:00:00+00 | 137 | 0.1% |
| 2019/11/30 00:00:00+00 | 136 | 0.1% |
| 2020/02/01 00:00:00+00 | 134 | 0.1% |
| Other values (2435) | 197736 |
Length
| Value | Count | Frequency (%) |
| 00:00:00+00 | 199146 | |
| 2019/12/06 | 155 | < 0.1% |
| 2016/11/08 | 147 | < 0.1% |
| 2017/10/27 | 143 | < 0.1% |
| 2018/05/11 | 141 | < 0.1% |
| 2018/06/08 | 141 | < 0.1% |
| 2016/11/15 | 139 | < 0.1% |
| 2016/10/13 | 137 | < 0.1% |
| 2018/03/09 | 137 | < 0.1% |
| 2019/11/30 | 136 | < 0.1% |
| Other values (2436) | 197870 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2059388 | |
| / | 398292 | 9.1% |
| : | 398292 | 9.1% |
| 2 | 357280 | 8.2% |
| 1 | 344233 | 7.9% |
| 199146 | 4.5% | |
| + | 199146 | 4.5% |
| 8 | 71091 | 1.6% |
| 7 | 69630 | 1.6% |
| 9 | 68920 | 1.6% |
| Other values (4) | 215794 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3186336 | |
| Other Punctuation | 796584 | 18.2% |
| Space Separator | 199146 | 4.5% |
| Math Symbol | 199146 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2059388 | |
| 2 | 357280 | 11.2% |
| 1 | 344233 | 10.8% |
| 8 | 71091 | 2.2% |
| 7 | 69630 | 2.2% |
| 9 | 68920 | 2.2% |
| 6 | 68422 | 2.1% |
| 5 | 64749 | 2.0% |
| 3 | 47749 | 1.5% |
| 4 | 34874 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 398292 | |
| : | 398292 |
Space Separator
| Value | Count | Frequency (%) |
| 199146 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 199146 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4381212 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2059388 | |
| / | 398292 | 9.1% |
| : | 398292 | 9.1% |
| 2 | 357280 | 8.2% |
| 1 | 344233 | 7.9% |
| 199146 | 4.5% | |
| + | 199146 | 4.5% |
| 8 | 71091 | 1.6% |
| 7 | 69630 | 1.6% |
| 9 | 68920 | 1.6% |
| Other values (4) | 215794 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4381212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2059388 | |
| / | 398292 | 9.1% |
| : | 398292 | 9.1% |
| 2 | 357280 | 8.2% |
| 1 | 344233 | 7.9% |
| 199146 | 4.5% | |
| + | 199146 | 4.5% |
| 8 | 71091 | 1.6% |
| 7 | 69630 | 1.6% |
| 9 | 68920 | 1.6% |
| Other values (4) | 215794 | 4.9% |
ANO_OCURRENCIA_ACC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2017.760106 |
| Minimum | 2015 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 2015 |
|---|---|
| 5-th percentile | 2015 |
| Q1 | 2016 |
| median | 2018 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.855070873 |
|---|---|
| Coefficient of variation (CV) | 0.00091937137 |
| Kurtosis | -1.060527733 |
| Mean | 2017.760106 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1232263737 |
| Sum | 401828854 |
| Variance | 3.441287943 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2018 | 33418 | |
| 2019 | 32962 | |
| 2017 | 32415 | |
| 2016 | 31928 | |
| 2015 | 27885 | |
| 2020 | 22424 | |
| 2021 | 18114 |
| Value | Count | Frequency (%) |
| 2015 | 27885 | |
| 2016 | 31928 | |
| 2017 | 32415 | |
| 2018 | 33418 | |
| 2019 | 32962 | |
| 2020 | 22424 | |
| 2021 | 18114 |
| Value | Count | Frequency (%) |
| 2021 | 18114 | |
| 2020 | 22424 | |
| 2019 | 32962 | |
| 2018 | 33418 | |
| 2017 | 32415 | |
| 2016 | 31928 | |
| 2015 | 27885 |
| Distinct | 100085 |
|---|---|
| Distinct (%) | 50.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.3 MiB |
| KR 80-CL 2 51 | 213 |
|---|---|
| AV AVENIDA BOYACA-CL 80 02 | 153 |
| CL 13-KR 72 02 | 149 |
| CL 100-KR 15 02 | 144 |
| AV AVENIDA CIUDAD DE CALI-CL 26 02 | 140 |
| Other values (100080) |
Length
| Max length | 73 |
|---|---|
| Median length | 69 |
| Mean length | 18.10139797 |
| Min length | 11 |
Characters and Unicode
| Total characters | 3604821 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 73029 ? |
|---|---|
| Unique (%) | 36.7% |
Sample
| 1st row | AV AVENIDA BOYACA-CL 79 02 |
|---|---|
| 2nd row | CL 26 S- KR 50 02 |
| 3rd row | KR 9 - CL 100 02 |
| 4th row | CL 63A-KR 72 S 02 |
| 5th row | KR 27-CL 9 14 |
Common Values
| Value | Count | Frequency (%) |
| KR 80-CL 2 51 | 213 | 0.1% |
| AV AVENIDA BOYACA-CL 80 02 | 153 | 0.1% |
| CL 13-KR 72 02 | 149 | 0.1% |
| CL 100-KR 15 02 | 144 | 0.1% |
| AV AVENIDA CIUDAD DE CALI-CL 26 02 | 140 | 0.1% |
| AV AVENIDA BOYACA-CL 26 02 | 138 | 0.1% |
| CL 80-KR 72 02 | 135 | 0.1% |
| KR 50-CL 3 02 | 132 | 0.1% |
| AV AVENIDA DEL SUR-CL 59 S 02 | 124 | 0.1% |
| AV AVENIDA DE LAS AMERICAS-KR 68 02 | 118 | 0.1% |
| Other values (100075) | 197700 |
Length
| Value | Count | Frequency (%) |
| 02 | 114483 | 12.0% |
| cl | 80693 | 8.4% |
| kr | 78958 | 8.3% |
| 2 | 51145 | 5.3% |
| s | 46375 | 4.8% |
| avenida | 34828 | 3.6% |
| av | 33863 | 3.5% |
| 15546 | 1.6% | |
| de | 14621 | 1.5% |
| ak | 9188 | 1.0% |
| Other values (6162) | 476973 |
Most occurring characters
| Value | Count | Frequency (%) |
| 773557 | ||
| 2 | 254937 | 7.1% |
| C | 236570 | 6.6% |
| A | 226401 | 6.3% |
| - | 199142 | 5.5% |
| L | 193752 | 5.4% |
| R | 185472 | 5.1% |
| 0 | 183022 | 5.1% |
| K | 169706 | 4.7% |
| 1 | 145763 | 4.0% |
| Other values (35) | 1036499 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1541642 | |
| Decimal Number | 1090368 | |
| Space Separator | 773557 | |
| Dash Punctuation | 199142 | 5.5% |
| Lowercase Letter | 112 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 236570 | |
| A | 226401 | |
| L | 193752 | |
| R | 185472 | |
| K | 169706 | |
| D | 79985 | 5.2% |
| V | 74898 | 4.9% |
| S | 71092 | 4.6% |
| I | 68661 | 4.5% |
| E | 67623 | 4.4% |
| Other values (16) | 167482 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 254937 | |
| 0 | 183022 | |
| 1 | 145763 | |
| 6 | 83334 | 7.6% |
| 7 | 81668 | 7.5% |
| 3 | 78141 | 7.2% |
| 5 | 72200 | 6.6% |
| 8 | 69284 | 6.4% |
| 4 | 68782 | 6.3% |
| 9 | 53237 | 4.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 44 | |
| n | 22 | |
| u | 22 | |
| a | 9 | 8.0% |
| c | 7 | 6.2% |
| b | 6 | 5.4% |
| f | 2 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 773557 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 199142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2063067 | |
| Latin | 1541754 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 236570 | |
| A | 226401 | |
| L | 193752 | |
| R | 185472 | |
| K | 169706 | |
| D | 79985 | 5.2% |
| V | 74898 | 4.9% |
| S | 71092 | 4.6% |
| I | 68661 | 4.5% |
| E | 67623 | 4.4% |
| Other values (23) | 167594 |
Common
| Value | Count | Frequency (%) |
| 773557 | ||
| 2 | 254937 | 12.4% |
| - | 199142 | 9.7% |
| 0 | 183022 | 8.9% |
| 1 | 145763 | 7.1% |
| 6 | 83334 | 4.0% |
| 7 | 81668 | 4.0% |
| 3 | 78141 | 3.8% |
| 5 | 72200 | 3.5% |
| 8 | 69284 | 3.4% |
| Other values (2) | 122019 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3604821 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 773557 | ||
| 2 | 254937 | 7.1% |
| C | 236570 | 6.6% |
| A | 226401 | 6.3% |
| - | 199142 | 5.5% |
| L | 193752 | 5.4% |
| R | 185472 | 5.1% |
| 0 | 183022 | 5.1% |
| K | 169706 | 4.7% |
| 1 | 145763 | 4.0% |
| Other values (35) | 1036499 |
GRAVEDAD
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.8 MiB |
| SOLO DANOS | |
|---|---|
| CON HERIDOS | |
| CON MUERTOS | 3239 |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.35621604 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2062399 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SOLO DANOS |
|---|---|
| 2nd row | CON HERIDOS |
| 3rd row | SOLO DANOS |
| 4th row | SOLO DANOS |
| 5th row | SOLO DANOS |
Common Values
| Value | Count | Frequency (%) |
| SOLO DANOS | 128207 | |
| CON HERIDOS | 67700 | |
| CON MUERTOS | 3239 | 1.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| solo | 128207 | |
| danos | 128207 | |
| con | 70939 | |
| heridos | 67700 | |
| muertos | 3239 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 526499 | |
| S | 327353 | |
| 199146 | 9.7% | |
| N | 199146 | 9.7% |
| D | 195907 | 9.5% |
| L | 128207 | 6.2% |
| A | 128207 | 6.2% |
| C | 70939 | 3.4% |
| E | 70939 | 3.4% |
| R | 70939 | 3.4% |
| Other values (5) | 145117 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1863253 | |
| Space Separator | 199146 | 9.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 526499 | |
| S | 327353 | |
| N | 199146 | 10.7% |
| D | 195907 | 10.5% |
| L | 128207 | 6.9% |
| A | 128207 | 6.9% |
| C | 70939 | 3.8% |
| E | 70939 | 3.8% |
| R | 70939 | 3.8% |
| H | 67700 | 3.6% |
| Other values (4) | 77417 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 199146 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1863253 | |
| Common | 199146 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 526499 | |
| S | 327353 | |
| N | 199146 | 10.7% |
| D | 195907 | 10.5% |
| L | 128207 | 6.9% |
| A | 128207 | 6.9% |
| C | 70939 | 3.8% |
| E | 70939 | 3.8% |
| R | 70939 | 3.8% |
| H | 67700 | 3.6% |
| Other values (4) | 77417 | 4.2% |
Common
| Value | Count | Frequency (%) |
| 199146 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2062399 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 526499 | |
| S | 327353 | |
| 199146 | 9.7% | |
| N | 199146 | 9.7% |
| D | 195907 | 9.5% |
| L | 128207 | 6.2% |
| A | 128207 | 6.2% |
| C | 70939 | 3.4% |
| E | 70939 | 3.4% |
| R | 70939 | 3.4% |
| Other values (5) | 145117 | 7.0% |
CLASE_ACC
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 MiB |
| CHOQUE | |
|---|---|
| ATROPELLO | |
| CAIDA DE OCUPANTE | 4639 |
| VOLCAMIENTO | 2729 |
| OTRO | 804 |
| Other values (2) | 34 |
Length
| Max length | 17 |
|---|---|
| Median length | 6 |
| Mean length | 6.62048949 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1318444 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CHOQUE |
|---|---|
| 2nd row | OTRO |
| 3rd row | CHOQUE |
| 4th row | CHOQUE |
| 5th row | CHOQUE |
Common Values
| Value | Count | Frequency (%) |
| CHOQUE | 170802 | |
| ATROPELLO | 20138 | 10.1% |
| CAIDA DE OCUPANTE | 4639 | 2.3% |
| VOLCAMIENTO | 2729 | 1.4% |
| OTRO | 804 | 0.4% |
| INCENDIO | 24 | < 0.1% |
| AUTOLESION | 10 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| choque | 170802 | |
| atropello | 20138 | 9.7% |
| caida | 4639 | 2.2% |
| de | 4639 | 2.2% |
| ocupante | 4639 | 2.2% |
| volcamiento | 2729 | 1.3% |
| otro | 804 | 0.4% |
| incendio | 24 | < 0.1% |
| autolesion | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 222827 | |
| E | 202981 | |
| C | 182833 | |
| U | 175451 | |
| H | 170802 | |
| Q | 170802 | |
| L | 43015 | 3.3% |
| A | 36794 | 2.8% |
| T | 28320 | 2.1% |
| P | 24777 | 1.9% |
| Other values (8) | 59842 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1309166 | |
| Space Separator | 9278 | 0.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 222827 | |
| E | 202981 | |
| C | 182833 | |
| U | 175451 | |
| H | 170802 | |
| Q | 170802 | |
| L | 43015 | 3.3% |
| A | 36794 | 2.8% |
| T | 28320 | 2.2% |
| P | 24777 | 1.9% |
| Other values (7) | 50564 | 3.9% |
Space Separator
| Value | Count | Frequency (%) |
| 9278 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1309166 | |
| Common | 9278 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 222827 | |
| E | 202981 | |
| C | 182833 | |
| U | 175451 | |
| H | 170802 | |
| Q | 170802 | |
| L | 43015 | 3.3% |
| A | 36794 | 2.8% |
| T | 28320 | 2.2% |
| P | 24777 | 1.9% |
| Other values (7) | 50564 | 3.9% |
Common
| Value | Count | Frequency (%) |
| 9278 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1318444 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 222827 | |
| E | 202981 | |
| C | 182833 | |
| U | 175451 | |
| H | 170802 | |
| Q | 170802 | |
| L | 43015 | 3.3% |
| A | 36794 | 2.8% |
| T | 28320 | 2.1% |
| P | 24777 | 1.9% |
| Other values (8) | 59842 | 4.5% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 46 |
| Missing (%) | < 0.1% |
| Memory size | 12.5 MiB |
| KENNEDY | |
|---|---|
| ENGATIVA | |
| USAQUEN | |
| SUBA | |
| FONTIBON | |
| Other values (15) |
Length
| Max length | 18 |
|---|---|
| Median length | 13 |
| Mean length | 8.947840281 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1781515 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENGATIVA |
|---|---|
| 2nd row | PUENTE ARANDA |
| 3rd row | USAQUEN |
| 4th row | CIUDAD BOLIVAR |
| 5th row | LOS MARTIRES |
Common Values
| Value | Count | Frequency (%) |
| KENNEDY | 23661 | |
| ENGATIVA | 20928 | |
| USAQUEN | 19292 | |
| SUBA | 18973 | |
| FONTIBON | 16377 | 8.2% |
| PUENTE ARANDA | 14143 | 7.1% |
| CHAPINERO | 11696 | 5.9% |
| TEUSAQUILLO | 10167 | 5.1% |
| BARRIOS UNIDOS | 10094 | 5.1% |
| BOSA | 9417 | 4.7% |
| Other values (10) | 44352 |
Length
| Value | Count | Frequency (%) |
| kennedy | 23661 | 9.0% |
| engativa | 20928 | 8.0% |
| usaquen | 19292 | 7.3% |
| suba | 18973 | 7.2% |
| fontibon | 16377 | 6.2% |
| puente | 14143 | 5.4% |
| aranda | 14143 | 5.4% |
| chapinero | 11696 | 4.5% |
| uribe | 10666 | 4.1% |
| teusaquillo | 10167 | 3.9% |
| Other values (18) | 102719 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 228524 | |
| N | 202275 | |
| E | 175641 | |
| U | 135525 | 7.6% |
| I | 131292 | 7.4% |
| O | 120326 | 6.8% |
| S | 110682 | 6.2% |
| T | 93074 | 5.2% |
| R | 92469 | 5.2% |
| B | 78867 | 4.4% |
| Other values (15) | 412840 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1717850 | |
| Space Separator | 63665 | 3.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 228524 | |
| N | 202275 | |
| E | 175641 | |
| U | 135525 | 7.9% |
| I | 131292 | 7.6% |
| O | 120326 | 7.0% |
| S | 110682 | 6.4% |
| T | 93074 | 5.4% |
| R | 92469 | 5.4% |
| B | 78867 | 4.6% |
| Other values (14) | 349175 |
Space Separator
| Value | Count | Frequency (%) |
| 63665 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1717850 | |
| Common | 63665 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 228524 | |
| N | 202275 | |
| E | 175641 | |
| U | 135525 | 7.9% |
| I | 131292 | 7.6% |
| O | 120326 | 7.0% |
| S | 110682 | 6.4% |
| T | 93074 | 5.4% |
| R | 92469 | 5.4% |
| B | 78867 | 4.6% |
| Other values (14) | 349175 |
Common
| Value | Count | Frequency (%) |
| 63665 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1781515 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 228524 | |
| N | 202275 | |
| E | 175641 | |
| U | 135525 | 7.6% |
| I | 131292 | 7.4% |
| O | 120326 | 6.8% |
| S | 110682 | 6.2% |
| T | 93074 | 5.2% |
| R | 92469 | 5.2% |
| B | 78867 | 4.4% |
| Other values (15) | 412840 |
| Distinct | 162457 |
|---|---|
| Distinct (%) | 81.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.0 MiB |
| 2018/01/29 13:00:00+00 | 7 |
|---|---|
| 2016/12/03 14:00:00+00 | 7 |
| 2016/03/08 12:30:00+00 | 7 |
| 2016/09/13 15:00:00+00 | 7 |
| 2016/06/07 07:00:00+00 | 6 |
| Other values (162452) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 4381212 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 133857 ? |
|---|---|
| Unique (%) | 67.2% |
Sample
| 1st row | 2017/06/12 05:30:00+00 |
|---|---|
| 2nd row | 2020/11/19 02:05:00+00 |
| 3rd row | 2020/11/10 13:30:00+00 |
| 4th row | 2015/05/11 10:50:00+00 |
| 5th row | 2016/06/08 21:30:00+00 |
Common Values
| Value | Count | Frequency (%) |
| 2018/01/29 13:00:00+00 | 7 | < 0.1% |
| 2016/12/03 14:00:00+00 | 7 | < 0.1% |
| 2016/03/08 12:30:00+00 | 7 | < 0.1% |
| 2016/09/13 15:00:00+00 | 7 | < 0.1% |
| 2016/06/07 07:00:00+00 | 6 | < 0.1% |
| 2017/02/17 15:00:00+00 | 6 | < 0.1% |
| 2016/11/21 11:00:00+00 | 6 | < 0.1% |
| 2018/03/21 14:00:00+00 | 6 | < 0.1% |
| 2017/07/15 14:00:00+00 | 6 | < 0.1% |
| 2016/10/12 14:00:00+00 | 6 | < 0.1% |
| Other values (162447) | 199082 |
Length
| Value | Count | Frequency (%) |
| 14:00:00+00 | 2487 | 0.6% |
| 13:00:00+00 | 2410 | 0.6% |
| 15:00:00+00 | 2375 | 0.6% |
| 14:30:00+00 | 2346 | 0.6% |
| 13:30:00+00 | 2269 | 0.6% |
| 11:00:00+00 | 2261 | 0.6% |
| 12:30:00+00 | 2259 | 0.6% |
| 16:00:00+00 | 2220 | 0.6% |
| 15:30:00+00 | 2194 | 0.6% |
| 08:00:00+00 | 2186 | 0.5% |
| Other values (3874) | 375285 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1524738 | |
| 1 | 513274 | 11.7% |
| 2 | 429592 | 9.8% |
| / | 398292 | 9.1% |
| : | 398292 | 9.1% |
| 199146 | 4.5% | |
| + | 199146 | 4.5% |
| 5 | 139411 | 3.2% |
| 3 | 114185 | 2.6% |
| 8 | 96801 | 2.2% |
| Other values (4) | 368335 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3186336 | |
| Other Punctuation | 796584 | 18.2% |
| Space Separator | 199146 | 4.5% |
| Math Symbol | 199146 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1524738 | |
| 1 | 513274 | 16.1% |
| 2 | 429592 | 13.5% |
| 5 | 139411 | 4.4% |
| 3 | 114185 | 3.6% |
| 8 | 96801 | 3.0% |
| 7 | 96422 | 3.0% |
| 6 | 93478 | 2.9% |
| 9 | 93162 | 2.9% |
| 4 | 85273 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 398292 | |
| : | 398292 |
Space Separator
| Value | Count | Frequency (%) |
| 199146 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 199146 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4381212 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1524738 | |
| 1 | 513274 | 11.7% |
| 2 | 429592 | 9.8% |
| / | 398292 | 9.1% |
| : | 398292 | 9.1% |
| 199146 | 4.5% | |
| + | 199146 | 4.5% |
| 5 | 139411 | 3.2% |
| 3 | 114185 | 2.6% |
| 8 | 96801 | 2.2% |
| Other values (4) | 368335 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4381212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1524738 | |
| 1 | 513274 | 11.7% |
| 2 | 429592 | 9.8% |
| / | 398292 | 9.1% |
| : | 398292 | 9.1% |
| 199146 | 4.5% | |
| + | 199146 | 4.5% |
| 5 | 139411 | 3.2% |
| 3 | 114185 | 2.6% |
| 8 | 96801 | 2.2% |
| Other values (4) | 368335 | 8.4% |
| Distinct | 108427 |
|---|---|
| Distinct (%) | 54.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.649135262 |
| Minimum | 4.0858 |
|---|---|
| Maximum | 4.82804068 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 4.0858 |
|---|---|
| 5-th percentile | 4.55955905 |
| Q1 | 4.60816318 |
| median | 4.64541534 |
| Q3 | 4.6900154 |
| 95-th percentile | 4.746763258 |
| Maximum | 4.82804068 |
| Range | 0.74224068 |
| Interquartile range (IQR) | 0.08185222 |
Descriptive statistics
| Standard deviation | 0.05756745196 |
|---|---|
| Coefficient of variation (CV) | 0.01238239989 |
| Kurtosis | -0.2474955365 |
| Mean | 4.649135262 |
| Median Absolute Deviation (MAD) | 0.041221905 |
| Skewness | 0.01773905907 |
| Sum | 925856.6909 |
| Variance | 0.003314011525 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.597 | 355 | 0.2% |
| 4.63236137 | 211 | 0.1% |
| 4.69573942 | 200 | 0.1% |
| 4.63 | 186 | 0.1% |
| 4.61554114 | 179 | 0.1% |
| 4.627 | 167 | 0.1% |
| 4.75160983 | 167 | 0.1% |
| 4.631 | 166 | 0.1% |
| 4.62715136 | 160 | 0.1% |
| 4.628 | 154 | 0.1% |
| Other values (108417) | 197201 |
| Value | Count | Frequency (%) |
| 4.0858 | 1 | |
| 4.19124841 | 1 | |
| 4.3031 | 1 | |
| 4.372 | 1 | |
| 4.382 | 1 | |
| 4.385 | 1 | |
| 4.386 | 1 | |
| 4.387 | 1 | |
| 4.38824487 | 1 | |
| 4.391 | 1 |
| Value | Count | Frequency (%) |
| 4.82804068 | 1 | < 0.1% |
| 4.825 | 2 | < 0.1% |
| 4.82442932 | 1 | < 0.1% |
| 4.82375217 | 1 | < 0.1% |
| 4.82315459 | 1 | < 0.1% |
| 4.82192447 | 1 | < 0.1% |
| 4.821 | 1 | < 0.1% |
| 4.8208 | 1 | < 0.1% |
| 4.82076242 | 12 | |
| 4.82069488 | 1 | < 0.1% |
| Distinct | 107698 |
|---|---|
| Distinct (%) | 54.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -74.10409555 |
| Minimum | -74.2283 |
|---|---|
| Maximum | -74.011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 199146 |
| Negative (%) | 100.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -74.2283 |
|---|---|
| 5-th percentile | -74.17267833 |
| Q1 | -74.13419566 |
| median | -74.10304278 |
| Q3 | -74.07300744 |
| 95-th percentile | -74.04185926 |
| Maximum | -74.011 |
| Range | 0.2173 |
| Interquartile range (IQR) | 0.06118822 |
Descriptive statistics
| Standard deviation | 0.04009855411 |
|---|---|
| Coefficient of variation (CV) | -0.0005411111736 |
| Kurtosis | -0.6560662603 |
| Mean | -74.10409555 |
| Median Absolute Deviation (MAD) | 0.03050675 |
| Skewness | -0.1595583456 |
| Sum | -14757534.21 |
| Variance | 0.001607894042 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -74.103 | 254 | 0.1% |
| -74.138 | 244 | 0.1% |
| -74.112 | 221 | 0.1% |
| -74.139 | 216 | 0.1% |
| -74.15405306 | 211 | 0.1% |
| -74.08952915 | 200 | 0.1% |
| -74.079 | 195 | 0.1% |
| -74.1 | 189 | 0.1% |
| -74.084 | 184 | 0.1% |
| -74.11448906 | 179 | 0.1% |
| Other values (107688) | 197053 |
| Value | Count | Frequency (%) |
| -74.2283 | 1 | |
| -74.218 | 1 | |
| -74.2152414 | 1 | |
| -74.215 | 2 | |
| -74.21498272 | 1 | |
| -74.21495012 | 1 | |
| -74.21492074 | 1 | |
| -74.21477615 | 1 | |
| -74.2147 | 1 | |
| -74.2146 | 1 |
| Value | Count | Frequency (%) |
| -74.011 | 1 | |
| -74.013 | 2 | |
| -74.013247 | 1 | |
| -74.0139 | 1 | |
| -74.01391874 | 1 | |
| -74.01398528 | 1 | |
| -74.014 | 1 | |
| -74.01405288 | 1 | |
| -74.01447867 | 1 | |
| -74.01461162 | 1 |
| Distinct | 38419 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 1701 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13798945.23 |
| Minimum | 0 |
|---|---|
| Maximum | 50009618 |
| Zeros | 883 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1004508 |
| Q1 | 7004943 |
| median | 10007581 |
| Q3 | 15001163 |
| 95-th percentile | 50007085 |
| Maximum | 50009618 |
| Range | 50009618 |
| Interquartile range (IQR) | 7996220 |
Descriptive statistics
| Standard deviation | 13467556.24 |
|---|---|
| Coefficient of variation (CV) | 0.9759844695 |
| Kurtosis | 2.637417592 |
| Mean | 13798945.23 |
| Median Absolute Deviation (MAD) | 3994125 |
| Skewness | 1.897446863 |
| Sum | 2.724532741 × 1012 |
| Variance | 1.813750711 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 883 | 0.4% |
| 10006560 | 324 | 0.2% |
| 8004259 | 318 | 0.2% |
| 13002027 | 308 | 0.2% |
| 9003941 | 287 | 0.1% |
| 10003679 | 249 | 0.1% |
| 10000256 | 229 | 0.1% |
| 8000071 | 228 | 0.1% |
| 10008661 | 226 | 0.1% |
| 8012646 | 220 | 0.1% |
| Other values (38409) | 194173 | |
| (Missing) | 1701 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 883 | |
| 1000001 | 9 | < 0.1% |
| 1000002 | 20 | < 0.1% |
| 1000003 | 18 | < 0.1% |
| 1000007 | 7 | < 0.1% |
| 1000009 | 2 | < 0.1% |
| 1000011 | 4 | < 0.1% |
| 1000012 | 4 | < 0.1% |
| 1000016 | 1 | < 0.1% |
| 1000021 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 50009618 | 1 | < 0.1% |
| 50009600 | 1 | < 0.1% |
| 50009598 | 3 | < 0.1% |
| 50009568 | 2 | < 0.1% |
| 50009554 | 4 | |
| 50009551 | 1 | < 0.1% |
| 50009550 | 1 | < 0.1% |
| 50009547 | 8 | |
| 50009526 | 1 | < 0.1% |
| 50009497 | 3 | < 0.1% |
| Distinct | 37953 |
|---|---|
| Distinct (%) | 23.5% |
| Missing | 37974 |
| Missing (%) | 19.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7588678.331 |
| Minimum | 0 |
|---|---|
| Maximum | 91030491 |
| Zeros | 47 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4384 |
| Q1 | 43267 |
| median | 176900 |
| Q3 | 235485 |
| 95-th percentile | 50015428 |
| Maximum | 91030491 |
| Range | 91030491 |
| Interquartile range (IQR) | 192218 |
Descriptive statistics
| Standard deviation | 18408766.92 |
|---|---|
| Coefficient of variation (CV) | 2.425819901 |
| Kurtosis | 3.648846114 |
| Mean | 7588678.331 |
| Median Absolute Deviation (MAD) | 87282 |
| Skewness | 2.224388789 |
| Sum | 1.223082464 × 1012 |
| Variance | 3.388826994 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2570 | 1.3% |
| 1 | 571 | 0.3% |
| 170406 | 310 | 0.2% |
| 221312 | 210 | 0.1% |
| 50016858 | 184 | 0.1% |
| 193781 | 181 | 0.1% |
| 29830 | 170 | 0.1% |
| 50012516 | 168 | 0.1% |
| 140172 | 156 | 0.1% |
| 34687 | 153 | 0.1% |
| Other values (37943) | 156499 | |
| (Missing) | 37974 | 19.1% |
| Value | Count | Frequency (%) |
| 0 | 47 | < 0.1% |
| 1 | 571 | 0.3% |
| 3 | 2570 | |
| 25 | 1 | < 0.1% |
| 37 | 3 | < 0.1% |
| 38 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 65 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 71 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 91030491 | 1 | < 0.1% |
| 91030442 | 3 | |
| 91030169 | 1 | < 0.1% |
| 91030166 | 1 | < 0.1% |
| 91029118 | 1 | < 0.1% |
| 91029105 | 1 | < 0.1% |
| 91029095 | 1 | < 0.1% |
| 91029093 | 1 | < 0.1% |
| 91029066 | 1 | < 0.1% |
| 91029059 | 1 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| X | Y | OBJECTID | FORMULARIO | CODIGO_ACCIDENTE | FECHA_OCURRENCIA_ACC | ANO_OCURRENCIA_ACC | DIRECCION | GRAVEDAD | CLASE_ACC | LOCALIDAD | FECHA_HORA_ACC | LATITUD | LONGITUD | CIV | PK_CALZADA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -74.090924 | 4.693807 | 1 | A000640275 | 4484660 | 2017/06/12 00:00:00+00 | 2017 | AV AVENIDA BOYACA-CL 79 02 | SOLO DANOS | CHOQUE | ENGATIVA | 2017/06/12 05:30:00+00 | 4.693807 | -74.090924 | 10006772.0 | 221236.0 |
| 1 | -74.121000 | 4.603000 | 2 | A001233353 | 10533499 | 2020/11/19 00:00:00+00 | 2020 | CL 26 S- KR 50 02 | CON HERIDOS | OTRO | PUENTE ARANDA | 2020/11/19 02:05:00+00 | 4.603000 | -74.121000 | 16004560.0 | NaN |
| 2 | -74.042000 | 4.682000 | 4 | A001232786 | 10533629 | 2020/11/10 00:00:00+00 | 2020 | KR 9 - CL 100 02 | SOLO DANOS | CHOQUE | USAQUEN | 2020/11/10 13:30:00+00 | 4.682000 | -74.042000 | 30001107.0 | NaN |
| 3 | -74.166937 | 4.587187 | 7 | A000200705 | 4412699 | 2015/05/11 00:00:00+00 | 2015 | CL 63A-KR 72 S 02 | SOLO DANOS | CHOQUE | CIUDAD BOLIVAR | 2015/05/11 10:50:00+00 | 4.587187 | -74.166937 | 19001483.0 | 136166.0 |
| 4 | -74.092901 | 4.607648 | 8 | A000402862 | 4447845 | 2016/06/08 00:00:00+00 | 2016 | KR 27-CL 9 14 | SOLO DANOS | CHOQUE | LOS MARTIRES | 2016/06/08 21:30:00+00 | 4.607648 | -74.092901 | 14000548.0 | 239719.0 |
| 5 | -74.042000 | 4.778000 | 9 | A001179874 | 10533587 | 2020/08/03 00:00:00+00 | 2020 | AU NORTE - CL 200 02 | CON MUERTOS | ATROPELLO | SUBA | 2020/08/03 14:05:00+00 | 4.778000 | -74.042000 | 1006455.0 | NaN |
| 6 | -74.055853 | 4.724626 | 10 | A000240105 | 4424883 | 2015/09/26 00:00:00+00 | 2015 | KR 52A-CL 137A 35 | SOLO DANOS | CHOQUE | SUBA | 2015/09/26 18:00:00+00 | 4.724626 | -74.055853 | 11008301.0 | 31431.0 |
| 7 | -74.039000 | 4.796000 | 12 | A001233064 | 10533503 | 2020/11/23 00:00:00+00 | 2020 | AU NORTE - CL 220 02 | SOLO DANOS | OTRO | USAQUEN | 2020/11/23 11:50:00+00 | 4.796000 | -74.039000 | NaN | NaN |
| 8 | -74.110585 | 4.693578 | 13 | A000551010 | 4468708 | 2016/12/27 00:00:00+00 | 2016 | CL 69A-KR 89A 02 | CON HERIDOS | CHOQUE | ENGATIVA | 2016/12/27 19:00:00+00 | 4.693578 | -74.110585 | 10006813.0 | 219627.0 |
| 9 | -74.135766 | 4.659330 | 15 | A000686495 | 4495519 | 2017/10/02 00:00:00+00 | 2017 | AV AVENIDA CIUDAD DE CALI-KR 17 2 | CON HERIDOS | CHOQUE | FONTIBON | 2017/10/02 09:20:00+00 | 4.659330 | -74.135766 | 50008531.0 | 272348.0 |
Last rows
| X | Y | OBJECTID | FORMULARIO | CODIGO_ACCIDENTE | FECHA_OCURRENCIA_ACC | ANO_OCURRENCIA_ACC | DIRECCION | GRAVEDAD | CLASE_ACC | LOCALIDAD | FECHA_HORA_ACC | LATITUD | LONGITUD | CIV | PK_CALZADA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199136 | -74.171000 | 4.628000 | 421902 | A001298808 | 10540139 | 2021/05/16 00:00:00+00 | 2021 | KR 86 - AV AVENIDA CIUDAD DE VILLAVICENCIO 02 | SOLO DANOS | CHOQUE | KENNEDY | 2021/05/16 23:30:00+00 | 4.628000 | -74.171000 | 8005072.0 | NaN |
| 199137 | -74.137864 | 4.626886 | 421903 | A001301685 | 10542512 | 2021/06/20 00:00:00+00 | 2021 | AV AVENIDA BOYACA - CL 3 B 18 | CON HERIDOS | CHOQUE | KENNEDY | 2021/06/20 06:15:00+00 | 4.626886 | -74.137864 | 8005344.0 | NaN |
| 199138 | -74.140587 | 4.615627 | 421904 | A001241254 | 10534370 | 2021/02/28 00:00:00+00 | 2021 | KR 72 - CL 35 S 79 | CON HERIDOS | VOLCAMIENTO | KENNEDY | 2021/02/28 01:50:00+00 | 4.615627 | -74.140587 | 8007974.0 | NaN |
| 199139 | -74.158000 | 4.628000 | 421905 | A001339673 | 10547144 | 2021/08/14 00:00:00+00 | 2021 | KR 80 - CL 38 S 02 | SOLO DANOS | CHOQUE | KENNEDY | 2021/08/14 15:53:00+00 | 4.628000 | -74.158000 | 50006497.0 | NaN |
| 199140 | -74.141000 | 4.610000 | 421906 | A001340014 | 10547398 | 2021/08/20 00:00:00+00 | 2021 | KR 72 - CL 38 S 02 | SOLO DANOS | CHOQUE | KENNEDY | 2021/08/20 01:15:00+00 | 4.610000 | -74.141000 | 8009540.0 | NaN |
| 199141 | -74.160000 | 4.637000 | 421907 | A001341297 | 10548522 | 2021/08/30 00:00:00+00 | 2021 | KR 86 F - CL 33 S 02 | SOLO DANOS | CHOQUE | KENNEDY | 2021/08/30 16:31:00+00 | 4.637000 | -74.160000 | 8003090.0 | NaN |
| 199142 | -74.167000 | 4.628000 | 421908 | A001305748 | 10546116 | 2021/08/03 00:00:00+00 | 2021 | CL 42 B S- KR 81 L 02 | CON HERIDOS | ATROPELLO | KENNEDY | 2021/08/03 14:00:00+00 | 4.628000 | -74.167000 | 8005066.0 | NaN |
| 199143 | -74.158247 | 4.624830 | 421909 | A001238302 | 10536074 | 2021/03/19 00:00:00+00 | 2021 | DG 2 S- KR 79 12 | CON HERIDOS | CHOQUE | KENNEDY | 2021/03/19 12:50:00+00 | 4.624830 | -74.158247 | 8005839.0 | NaN |
| 199144 | -74.167000 | 4.622000 | 421910 | A001297106 | 10538181 | 2021/04/18 00:00:00+00 | 2021 | CL 43 S- KR 80 02 | CON HERIDOS | CHOQUE | KENNEDY | 2021/04/18 21:21:00+00 | 4.622000 | -74.167000 | 8011660.0 | NaN |
| 199145 | -74.168000 | 4.630000 | 421911 | A001304271 | 10544226 | 2021/07/12 00:00:00+00 | 2021 | AV AVENIDA CIUDAD DE CALI - CL 42 S 02 | CON HERIDOS | CAIDA DE OCUPANTE | KENNEDY | 2021/07/12 19:53:00+00 | 4.630000 | -74.168000 | 8004623.0 | NaN |